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DETERMINE THE GALAXY BIAS FACTORS ON LARGE SCALES USING BISPECTRUM METHOD 

H. Guo 1 ' 2 , Y. P. Jing 1 

Draft version July 2, 2009 

ABSTRACT 

We study whether the bias factors of galaxies can be unbiasedly recovered from their power spectra 
and bispectra. We use a set of numerical Y-body simulations and construct large mock galaxy catalogs 
based upon the semi-analytical model of Croton et al. (2006). We measure the reduced bispectra for 
galaxies of different luminosity, and determine the linear and first nonlinear bias factors from their 
bispectra. We find that on large scales down to that of the wavenumber k = 0.1/iMpc -1 , the bias 
factors b\ and & 2 are nearly constant, and b\ obtained with the bispectrum method agrees very well 
with the expected value. The nonlinear bias factor b 2 is negative, except for the most luminous 
galaxies with M r < —23 which have a positive b 2 . The behavior of 62 of galaxies is consistent with the 
62 mass dependence of their host halos. We show that it is essential to have an accurate estimation of 
the dark matter bispectrum in order to have an unbiased measurement of b\ and 62. We also test the 
analytical approach of incorporating halo occupation distribution to model the galaxy power spectrum 
and bispectrum. The halo model predictions do not fit the simulation results well on the precision 
requirement of current cosmological studies. 

Subject headings: gravitational lensing — dark matter — cosmology: theory — galaxies: formation 



1. INTRODUCTION 

In the current ACDM cosmological scenario, dark mat- 
ter dominates the matter density in the universe, and 
luminous galaxies are thought to be formed within dark 
matter halos. However, detailed physical processes of 
galaxy formation, such as star formation and supernova 
feedback, are very complicated. Different prescriptions 
of galaxy formation may lead to different relations be- 
tween luminous galaxies and the underlying dark mat- 
ter, or equivalently, different galaxy biases (e.g., Kaiser 
1984; Davis et al. 1985; Bardeen et al. 1986). Therefore, 
the key to our understanding of galaxy formation is to 
study the galaxy bias both in theories and observations. 
If galaxy formation is mainly determined by local phys- 
ical processes (such as hydrodynamics), the galaxy bias 
is then a constant on large scales (Coles 1993). In the 
first order, the galaxy density fluctuation is proportional 
to that of the dark matter on large scales, S g oc S m . The 
coefficient here is usually called as the linear bias factor, 
bi. It is important to study its dependence on the lumi- 
nosity, morphology, mass, and redshift of galaxies (e.g., 
Xia et al. 1987; Borner et al. 1991; Norberg et al. 2002; 
Jing et al. 2002; Zehavi et al. 2004; Park et al. 2005; Li et 
al. 2006; Meneux et al. 2008), because this information 
provides important clues to how galaxies are formed. 

The bias factor b\ and its relation with the scale k can 
be determined directly from the ratios between the power 
spectra of galaxies (P g (k)) and those of the underlying 
dark matter (P m (fc)), 

Pg(k) = bl(k)P m (k), (1) 

where the higher-order terms are negligible. But it is 
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important to note that the power spectrum of dark mat- 
ter is usually not known in observations. Therefore, the 
absolute value of b\ (k) cannot be obtained through the 
above relation, unless the power spectrum of dark mat- 
ter is measured from observations, such as future weak 
lensing surveys. From Equation (1), we note that the 
bias factor b\ actually couples with the properties of the 
large-scale matter density field, such as the linear rms 
fluctuation, eg. And it is impossible to break such de- 
generacy only by using the galaxy power spectrum. It is 
then challenging to determine the galaxy bias purely in 
observations. 

Fortunately, there is a way of doing this by in- 
volving the second-order statistics, the bispectrum 
B{k\, fc 2 , k 3 ), or more conveniently the reduced bispec- 
trum Q{k\, ki, £3), which is defined by 



Q(k 1 ,k 2 ,k 3 ) 



5(fci,fc 2 ,fc 3 ) 
P(k 1 )P(k 2 ) + eye' 



(2) 



where (fci, fc 2 , ^3) are the three sides of a triangle and 
P(k) is the corresponding power spectrum (galaxy or 
dark matter) . On sufficiently large scales where the den- 
sity fluctuation is small, we can expand the galaxy den- 
sity field S g in the Taylor series of 5 m , which up to the 
second order can be written as 



S„ = bxS„ 



1 



boSl 



(3) 



where b\ is the linear bias factor and 62 is the first non- 
linear bias factor. If we assume that the bias factors 
are deterministic, i.e., constant b\ and 62, we can derive 
the following relation from Equation (3) by utilizing the 
reduced bispectra Q g of galaxies and Q m of dark mat- 
ter(e.g., Fry 1994; Gaztanaga & Frieman 1994; Mo, Jing 
& White 1997), 



,(ki,k 2 ,k 3 ) 



t (fci,fc 2 ,fc3) 



(4) 
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According to the second-order perturbation theory, the 
reduced bispectrum of dark matter Q m (ki, k 2 , ks) can 
be predicted on large scales based on the measurement 
of galaxy power spectrum. In the second-order perturba- 
tion theory (hereafter PT2), it gives (Fry 1984; Matarrese 
et al. 1997; Bernardeau et al. 2002) 

B PT (k 1 ,k 2 , h)=F(k 1 ,k 2 )P L (k 1 )P L (k 2 ) + eye (5) 

where fi = 3f2m 2 ^ 63 /7 denotes the mild dependence on 
cosmology and Ph{k) is the linear dark matter power 
spectrum. It is easy to see that within the tree-level PT, 
the reduced bispectrum Q m (fe,fe,fe) does not depend 
on rcdshift or amplitude of the matter density fluctua- 
tion, but weakly depends on the cosmology, thus depends 
almost solely on the shape of the matter power spec- 
trum. Therefore, the reduced bispectrum Q m (ki, k 2 , k$), 
on large scales, can be predicted from the measurement 
of galaxy power spectrum, since the galaxy power spec- 
trum P g (k) has the same shape as the matter power spec- 
trum P m (k) (they are proportional to each other on large 
scales). The bias factors b\ and b 2 can then be deter- 
mined from the measurement of Q g and the prediction 
of Q m , with at least two independent triangle configura- 
tions from the galaxy distribution. Many more configura- 
tions are actually used in the real measurement, because 
the measurement errors are present. With the value of 
&i, one can determine the amplitude of the linear den- 
sity fluctuation from the galaxy distribution alone, which 
can thus become a potentially powerful way for dark en- 
ergy study if the density fluctuation amplitude can be 
measured in this way for a few redshifts. The values of 
bi and b 2 also contain very important information about 
the formation of galaxies. 

With this method, Verde et al. (2002) measured the 
galaxy bias parameters using the observational data of 
2dF Galaxy Redshift Survey (2dFGRS). They found that 
the linear bias factor b\ is consistent with unity and 
the quadratic bias parameter b 2 is consistent with zero. 
Though the error bars of their measurements are also 
large, the results may still indicate that galaxies follow 
the distribution of dark matter. The range of scales as- 
sumed, however, was in the quasilinear and nonlinear 
regime (0.1 < k < 0.5/iMpc -1 ). On these scales, the 
second-order perturbation theory is already not accurate 
enough to predict the dark matter bispectrum, and even 
the bias expansion (Equation (4)) may have already bro- 
ken down, as we will show below. 

Considering the potential applications of this method 
to constrain the cosmological parameters (e.g., the am- 
plitude of the dark matter density distribution, the equa- 
tion of state of dark energy) from the measurement of b\ , 
and to constrain theories of galaxy formation from mea- 
suring bi and b 2 , it is important to test whether and on 
which scale this method can produce an unbiased mea- 
surement of bi and b 2 . Because the bias factors are gen- 
erally dependent on the scale k (against the assumption 
of scale-invariant b\ and b 2 ) 7 we need to check the range 



TABLE 1 
Simulation parameters 



boxsizc(/i 1 Mpc) 


particles 


realizations 


^particle 


600 


1024 3 


3 


1.5 x 10 10 h- 1 M Q 


1200 


1024 3 


4 


1.2 x W 11 h- 1 M e) 



of validity of Equation (4) and at the same time, we have 
to make sure whether it is consistent with Equation (1). 
We should keep in mind that for this method to be valid, 
the prediction of Q m must be accurate(i.e., PT2 must 
be precise on large scales) and the number density of 
galaxies can be expanded as in Equations (3) and (4). 

In our companion paper(Guo & Jing 2009), we have 
studied the accuracy of PT2 for the bispectrum of dark 
matter and we found that the PT2 prediction is not very 
accurate even on scales k ~ 0.1 /iMpc -1 , and high-order 
loop corrections are needed (e.g., Scoccimarro et al. 1998; 
Bernardeau et al. 2008). In the current paper, we will fo- 
cus on the second point, i.e., on which scale 8 g can be ex- 
panded as in Equation (3). We generate a set of catalogs 
of mock galaxies from a semi-analytical model of galaxy 
formation. Then we measure b\ and b 2 from the galaxy 
reduced bispectrum. The obtained linear bias factor b\ 
is compared with the expected value determined by the 
ratio of power spectra (Equation (1)) in the numerical 
simulations. We will also demonstrate that an accurate 
prediction of the dark matter bispectrum is necessary for 
obtaining correct bias factors. We constrain our discus- 
sion only in the real space (relative to the redshift space) 
to avoid the complicated observational effects. 

The paper is constructed as follows. We describe our 
simulations and galaxy mock catalogs in Section 2. The 
accuracy of determining the galaxy bias factors using bis- 
pectrum is displayed in Section 3. We use the halo model 
to study the influence of different components in Section 
4. We summarize our results in Section 5. 

2. Af-BODY SIMULATIONS 

The cosmology considered here is a canonical spatially 
flat cold dark matter model with the density parame- 
ter Cl m — 0.268, the cosmological constant Ha = 0.732, 
the Hubble constant h = 0.71, and the baryon density 
parameter Hi, = 0.045. The primordial density field is as- 
sumed to be Gaussian with a scale-invariant power spec- 
trum oefc. For the linear power spectrum, we generate 
it with the CMBfast code (Seljak & Zaldarriaga 1996), 
and the normalization is set to as = 0.85, where <7$ is the 
present linear rms density fluctuation within a sphere of 
radius 8 /i _1 Mpc. 

We use an upgraded version of the Particle-Particle- 
Particle-Mesh (P 3 M) code of Jing & Suto (1998, 2002) 
to simulate structure formation in the universe. The 
code has now incorporated the multiple level P 3 M grav- 
ity solver for high-density regions (Jing & Suto 2000). 
In order to have a large mass resolution range, we run 
a total of 7 simulations with 1024 3 particles in different 
simulation boxes, which we hereafter denote by L600 and 
L1200 by different box sizes(Table 1) (Jing et al. 2007). 
The simulations were run on an SGI Altix 350 with 16 
processors with OPENMP parallelization in Shanghai 
Astronomical Observatory. Dark matter halos are iden- 
tified using the standard Fricnds-of- Friends (FOF) algo- 
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rithm with a linking length b equal to 0.2 times the mean 
particle separation. Unbound particles (with positive 
binding energy) are excluded to avoid contamination. 
The different resolutions of the simulations enable us to 
check the consistency among the results from simulations 
of different Lb ox , as well as to investigate the behavior 
of bispectrum over a large dynamical range. Here, we 
choose the Fourier space bin scheme as Alogio(k) = 0.1 
for k < 0.1/iMpc -1 and 0.05 for k > 0.1/iMpc -1 when 
measuring the bispectrum. 

For the galaxy samples, we build our catalogs on the 
semi-analytical model of Croton et al. (2006). They 
have generated the galaxy distribution in the Millennium 
Simulation with the box size Lbox = 500/i _1 Mpc on a 
side(Springel et al. 2005). The cosmological parameters 
used in Croton et al. (2006) (O m = 0.25, ft A = 0.75, 
Ob = 0.045, h = 0.73, as = 0-9) are not exactly the same 
as ours. But the slight difference in the parameters is 
not the focus in this paper, since an approximate relation 
connecting galaxies to dark matter is sufficient here. For 
each dark matter halo in our simulations, we randomly 
choose a corresponding galaxy host halo of the same mass 
from the galaxy catalog of Croton et al. (2006). One 
thing to note is that the virial masses of the halos in 
Croton et al. (2006) are defined by the spherical over- 
density approach, which makes their final halo masses 
about 1.5 times smaller than those of our corresponding 
FOF halos. We adjust the masses of Croton et al. (2006) 
halos, so they would match our halo definitions. After 
the adjustment, we put those galaxies of the host halos of 
Croton et al. (2006) into our corresponding dark matter 
halos, while preserving the same relative positions to the 
halo centers. Finally, we obtain the corresponding dis- 
tributions of galaxies in our simulations, as well as their 
properties, such as the luminosity and velocity. We use 
the r-band magnitude for the luminosity of galaxies, M r , 
which is defined to be M r = M a \, s + 51g/i, that is, the 
absolute magnitude M a b. s when h = 1. 

We show in the top panel of Figure 1 the probability 
distribution of the virial mass m of our simulation dark 
matter halos. L1200 has more massive halos than L600, 
thus L1200 can be used to investigate the distribution of 
more luminous galaxies while L600 for ordinary galax- 
ies with moderate luminosity. The sharp cut for each 
Lbox represents the minimum mass of the dark matter 
halos that consist of at least 10 particles in our defini- 
tion. The host halo mass distributions of the central and 
satellite galaxies in the galaxy catalog of Croton et al. 
(2006) are also shown in the middle and bottom panels 
of Figure 1, respectively. We divide the galaxy samples 
into subsamples of different luminosity, and the proba- 
bility distribution of host halo masses are shown as the 
different lines in the figure. 

To avoid incompleteness, the available galaxy sub- 
samples are actually limited for each Lb ox simulation. As 
it is shown in Figure 1, we can use all those galaxies with 
M r < —20 for L600 (However, the most luminous galax- 
ies with M r < —23 are not abundant in L600, which 
will cause large fluctuations in their bispectrum statis- 
tics. That is why we do not consider this luminosity 
subsample for L600). For L1200, because dark matter 
halos have high mass of to > 10 12 /i _1 M Q , only those 
galaxies of M r < —22 will be analyzed. 




l°g,o m [ h-'M s ] 

Fig. 1. — Probability distribution of the virial mass m of the dark 
matter halos in our simulations (Top). The host halo mass distri- 
butions of the central and satellite galaxies in the galaxy catalog 
of Croton et al. (2006) are also shown in the middle and bottom 
panels. 

3. GALAXY BIASING 

To obtain the galaxy bias factors by fitting Equa- 
tion (4), one needs an accurate error estimation for the 
galaxy bispectrum Q g . Since we do not have enough 
independent Fourier space k modes on large scales of 
k < 0.1 /iMpc -1 (finite- volume effect), and the number of 
realizations for each Lbox simulation are also limited, the 
large fluctuations among different realizations on large 
scales would prevent us from making an accurate error 
estimation of Q g . So we use the Gaussian uncertainties, 
instead of the simulation fluctuations, as the errors for 
Q g . 

The uncertainty of bispectrum for a Gaussian density 
field reads (e.g., Fry & Gaztanaga 1993; Scoccimarro et 
al. 1998, 2004; Sefusatti & Komatsu 2007) 

(AB 2 g ) = J- Ptot(ki)P to t(k2)Ptot(k 3 ) (7) 

^123 

P tot (k)=P g (k) + ±-, ( 8 ) 

where N123 is the number of independent triangle config- 
uration modes in the Fourier space, and Ptot(k) includes 
the shot noise in the galaxy power spectrum for the case 
of N p objects. Although we measure Aq 2 3 directly from 
the simulations, it can be estimated theoretically as (Se- 
fusatti & Komatsu 2007) 

8ir 2 

N123 « - T k 1 k 2 k 3 Ak 1 Ak 2 Ak 3l (9) 

S123K/ 

where kf = 2ir / 'Lbox and S123 = 6,2,1 for equilateral, 
isosceles and general triangles. By assuming that the 
variance of bispectrum dominates over that of the power 
spectrum, variance of the reduced bispectrum is then 
given by, 
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Fig. 2. — Ratios of the reduced bispectrum errors measured from 
simulations to the errors based on the Gaussian field assumption. 
Those points are for galaxies of different luminosity (as shown in 
each panel), while the solid lines represent the ratios for dark mat- 
ter. 

(AB 2 n ) 

^ = iprnp^u 12 - (10) 
J [Pg{ki)Pg(k 2 ) + cyc\ 2 

The fluctuation of Q g is thus directly determined by the 
number of independent triangle modes, -/V123, which are 
actually related to the simulation box size, L^ ox . The 
larger box simulation has higher precision in the deter- 
mination of bispectrum on large scales, especially for 
k < 0.1 /iMpc -1 . In addition to the considerable fluc- 
tuations, the finite-volume effect is also significant for 
the bispectrum on large scales. Because the bispectrum 
essentially reflects the influence of the gravitational in- 
stability, the existence of large-scale structures, such as 
filaments, will significantly affect the final bispectrum 
estimation(e.g., Sefusatti & Scoccimarro 2005). Overall, 
large Lb ox simulations are essential if we want to deter- 
mine the bispectrum on such large scales with unerring 
accuracy, as we have processed in this paper. 

Figure 2 shows the ratios between the errors on the 
mean of Q averaged over the realizations and the cor- 
responding uncertainties based on the Gaussian assump- 
tion. The ratios for galaxies of different luminosity (sym- 
bols) are compared with those ratios for dark matter 
(solid lines) for both L1200 and L600. Given that tri- 
angles with a side in common will correlate with each 
other, we alleviate this problem by defining the scale k 
in Figure 2 to be the longest side of the triangle and by 
including all the triangle configurations in each k bin. 
The remaining correlations are ignored. So, each data 
point in Figure 2 represents the mean of all the triangle 
configurations whose longest sides are in the same k bin, 
and the 1 — a error shown is the fluctuation among the 
different triangle configurations. The small discrepancies 
of dark matter ratios between L1200 and L600 are caused 
by the different simulation resolutions, with L1200 more 



Fig. 3. — x 2 fitting results of Equation (4) for the linear bias 
factor bi(k) in the galaxy samples of L1200. The left panels show 
the fittings with Q m determined from L1200 simulations. In the 
middle panels, Q m is estimated with PT2 using the linear power 
spectrum Ph{k) in Equation (2) and Equation (5). In the right 
panels, Q m is also estimated with PT2, but the nonlinear power 
spectrum Pjvi(fc) estimated directly from the simulation is used 
for the PT2 predictions. The solid lines denote the expected value 
derived from the power spectrum ratios in Equation (1). The scale 
k is defined as the maximum value of the triangle sides (fci, fa, £3). 

reliable on large scales and L600 on small scales. As for 
galaxies of M r > —23, the reduced bispectrum errors can 
be well described by the Gaussian errors on large scales 
of k < 0.1 /iMpc -1 . The galaxies have similar error ra- 
tios to those of dark matter, implying that the deviation 
of the bispectrum error from the Gaussian field error 
is slim in the linear clustering regime. For the galaxy 
samples of M r < — 23, the simulation variances deviate 
rapidly from the Gaussian uncertainties even on large 
scales. This is because these galaxies are highly biased, 
and the non-Gaussianity already plays an important role 
in determining (AQg) for such luminous galaxies. Since 
we are interested in the behavior of the bias factors at 
k < 0.2 ft,Mpc -1 , we will use Equation (10) to estimate 
the errors for the galaxy reduced bispectrum. 

In Figure 3 (£1200) and Figure 4 (£600), we show 
the x 2 fitting results of Equation (4) for the linear bias 
factor bi (k). The left panels show the fitting results with 
Q m measured from simulations, and the middle panels 
show the results with Q m determined by PT2. As in 
observations, we also use the nonlinear power spectrum 
PNh{k) in Equations (2) and (5) for the PT2 estimation 
of Qm-, and the results are shown in the right panels. 
The solid lines denote the predictions derived from the 
power spectrum ratios as in Equation (1). Again, the 
scale k in the figures is defined as the maximum triangle 
size of (fci, ki, k'i). And for each k bin, we have included 
all types of triangle shapes to make the fittings. The 
jump in the errors of L600 at 0.1/iMpc -1 is caused by 
our change of Fourier space bin scheme at this scale. 

In the left panels, the bias factor b\(k) determined 
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Fig. 4. — Same as Figure 3, but for L600 galaxy samples. 

by Equation (4) is well consistent with that of the 
power spectrum ratio (Equation(l)) on large scales of 
k < 0.15 /iMpc -1 . This indicates that b\(k) can be recon- 
structed from bispectrum observations of galaxies with- 
out significant systematic bias. However, when Q m is cal- 
culated from PT2 (the middle panels), the fitted b\{k) are 
overestimated on most scales except for the largest one 
of k ~ 0.03 ft-Mpc -1 where the fluctuation is dramatic. 
The main reason is that PT2 overestimates Q m for col- 
inear configurations (a = arccos(k.i • k2)/feifc2 = and 
7r) and underestimates for triangles of a w 0.67T even on 
large scales of k « 0.1 hMpc^ 1 (Guo & Jing 2009). Using 
the nonlinear power spectrum for the PT2 prediction of 
Qm does not improve the fitting results, as shown by the 
right panels. Our results clearly show that it is neces- 
sary to have an accurate estimation of the dark matter 
bispectrum if Equation (4) is used to get the galaxy bias 
factors. 

When the scale goes down to k > 0.15 /iMpc" 1 , the 
nonlinear effects become important. The linear bias b\(k) 
does not keep a constant on these small scales. Then the 
premise of the fittings that b\ and bi both are scale- 
invariant in Equation (4) breaks down. We would ex- 
pect that these bias factors from the power spectrum 
ratios and from the bispectrum fittings will be different, 
though we note that they agree well on scales down to 
k = 0.4/iMpc -1 for galaxy samples of M r > —23, which 
could be a coincidence. As PT2 breaks down for the pre- 
diction of dark matter bispectrum even on scales slightly 
larger than that of k = 0.1 /iMpc -1 , our results show the 
expansion of the galaxy density fields in Equation (3) is 
valid on the smaller scales (k = 0.15 /iMpc" 1 ). 

To verify the conclusions above, we also show the scale 
dependence of bijb\ in Figure 5 (X1200) and Figure 6 
(X600). We see that bijb\ is nearly a constant on large 
scales of k < 0.15 hMpc^ 1 . With the scale independence 
of bi on these scales, we infer that the nonlinear bias b 2 is 



Fig. 5. — Same as Figure 3, but for the scale dependence of 
62 The solid lines show the case of a vanishing nonlinear bias 
factor &2 for comparison. Since the error bars are much too large 
for k < 0.03 ftMpc -1 in the middle and right panels, we do not 
show those data points here. 
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Fig. 6. — Same as Figure 5, but for L600 simulations. 

also scale invariant for small k. It again confirms the fea- 
sibility of deriving b\ and 62 from Equation (4) through 
direct x 2 fittings. But 62 deviates from such constancy 
as the scale goes to the nonlinear scales, similar to the 
case of the linear bias b\. In general, the nonlinear bias 
62 is nonvanishing on large scales and plays an impor- 
tant role in the determination of the linear bias factor 
b\ when using Equation (4). It is interesting to note 
that the most luminous samples have positive values of 



6 



b 2 (b 2 « 1.2 for M r < -23 and b 2 « 0.3 M r < -22.5) on 
large scales, but fainter samples have negative values of 
b 2 (e.g., 6 2 « -0.2 for -21 < M r < -20 and b 2 ~ -0.3 
for -23 < M r < -22). We note that the values of b 2 /b\ 
for galaxy samples with M r > —22.5 are nearly the same 
with b 2 /b\ = -0.2 at k < O.lftMpc -1 , which might be 
a coincidence for this particular semi-analytical model of 
galaxy formation. The general behavior that b 2 increases 
with the luminosity at the high luminosity end, and de- 
creases with increasing luminosity at fainter luminosity, 
is consistent with the change of nonlinear halo bias b 2 (m) 
with the dark matter halo mass m (Mo, Jing & White 
1997). 

The middle and right panels of Figure 5 and Figure 
6 clearly demonstrate the failure of the fittings for the 
bias factors when using PT2 to calculate Q m . (Since the 
error bars are much too large for k < 0.03 ftMpc -1 , we 
do not show those data points here.) Thus PT2 is ac- 
tually not a good estimator for the reduced dark matter 
bispectrum Q m when we want to use it to fit the bias 
factors with Equation (4). Verde et al. (2002) used PT2 
as the preferred choice for the estimation of Q m and they 
determine the bias factors for k = 0.1 ~ 0.5 hMpc~ . As 
we showed above, on these scales, the PT2 is not accu- 
rate enough(sce Guo & Jing 2009) and the fitting results 
of the bias factors are biased even if the measurement of 
Q g from galaxy surveys is accurate. 

4. AN ATTEMPT TO STUDY THE BISPECTRUM AT 
QUASILINEAR REGIME WITH THE HALO 
OCCUPATION MODEL 

Since the PT2 as well as the Taylor expansion breaks 
down quickly on quasilinear scales (k > 0.1 ft.Mpc _1 ), we 
want to explore if the halo occupation distribution(HOD) 
model of galaxies (e.g., Jing et al. 1998; Berlind & Wein- 
berg 2002; Yang, Mo, & van den Bosch 2003; Zheng et 
al. 2005) can extend the theoretical modeling to quasilin- 
ear and even nonlinear scales. The HOD model has been 
used to predict the bispectrum of galaxies in numerical 
simulations (Jing & Borner 1998; Jing & Borner 2004; 
Nichol ct al. 2006; Kulkarni et al. 2007; Nishimichi ct 
al. 2007; Marin et al. 2008) and in analytical modeling 
(Scoccimarro et al. 2001; Takada & Jain 2003; Wang et 
al. 2004). In this section, we check if the bispectra of 
galaxies on quasilinear scales can be modeled with the 
latter method. 

The necessary halo model ingredients arc described as 
follows. The dark matter halos are defined as objects 
with a mean density A„j r times that of the background 
universe(Bryan & Norman 1998) where A vir w 361 for 
our cosmology parameters, and their density distribu- 
tions follow the Navarro-Frenk-Whitc (NFW) profile 
(Navarro, Frcnk, & White 1997). The concentration 
parameter c(m) of the halos is given by the relation 
c(m) = c (m/m*) /3 , where c = 9, (3 — —0.13, and 
m* = 4.8 x 10 /i _1 M a is the nonlinear mass scale (Bul- 
lock et al. 2001). We also use the same linear power spec- 
trum as that used for generating the initial condition for 
the simulations. For the halo mass function(MF) n(m), 
we consider the analytical models of Press & Schechter 
(1974)(PS) and Sheth & Tormen (1999)(ST). For the 
halo bias parameters bi(m)(i — 1,2), we use the cor- 
responding results of Mo, Jing & White (1997) and 
Sheth & Tormen (1999) for PS and ST mass functions, 



respectively. The last ingredient of the model are the 
different moments of the galaxy distribution within the 
parent halos, i.e., the halo occupation number (N g (m)), 
the second moment (N g (N g — 1)) and the third moment 
(N g (Ng -l)(N g -2)). They are derived directly from the 
galaxy mock catalogs, and are inserted into the integrals 
of the halo model with interpolations of the simulation 
data points. 

Following Scoccimarro et al. (2001), the galaxy power 
spectrum and bispectrum are given by 

P g (k) = [G u (fc)f P L (k) + G 02 (fc, k) (11) 
B g (k u k2,h) = Gu(ki)Gn(k2)Gn(k3)Bp T 

+ [G 11 (fc 1 )G 11 (fc 2 )G 21 (fc 3 )i 3 L(fci)i 3 L(fc 2 ) + eye] 

+ [G 11 (fc 1 )G 12 (fc 2 ,fc 3 ) J PL(fci)+cyc] 

+ G 03 (k 1 ,k 2 ,k 3 ), (12) 

where 



Gij(ki 



,kj) — 



= J dmn(m) 
x [u(ki \m) 



(M(m)) 



bi{m) 



■ u{kj\m)] 



(13) 



and bo = 1. (N J g (m)) represents the different moments of 
the galaxy distribution mentioned above, u(k\m) is the 
normalized Fourier transform of the dark matter halo 
density profile p(r\m) truncated at the virial radius, and 
n g denotes the mean number density of galaxies, 



dmn(m){N g (m)). 



(14) 



We note that {N 3 g (m)) can be further decomposed into 
the central and satellite galaxy components and for the 
central galaxies, u(k\m) = 1. The galaxy reduced bis- 
pectrum Q g is then defined as 



Q g (k 1 ,k 2 ,k 3 ) 



Bgjku k 2 , fc 3 ) 
Pg{k l )P g {k 2 ) + cyc 



(15) 



In Figure 7, we show the ratio of the galaxy power spec- 
trum P g (k) to the linear dark matter power spectrum 
Pl(/c) only for the galaxy samples of —23 < M r < —22. 
The points denote the results of the simulations (the 
shot-noise effect has been corrected), and the different 
lines represent the halo model predictions for different 
halo mass functions. Interestingly, the model predictions 
agree very well with themselves on large scales, indicat- 
ing that the predictions on scales larger than that of 
k = 0.2/iMpc _1 are robust against the changes of the 
mass function and corresponding bias functions. How- 
ever, the halo model predictions are larger than those 
of the simulations by about 16% (from 0.03 /iMpc -1 to 
0.15 /iMpc~ 1 ) on large scales. The wiggles shown on large 
scales of the simulation results are caused by the Bary- 
onic Acoustic Oscillations (BAO). In the halo model con- 
figurations, BAO is indeed embedded in the linear power 
spectrum Ph{k). Then the ratio P g {k)/PL,{k) for the halo 
model will display no trace of oscillation on large scales, 
as shown in Figure 7. Since galaxies as well as the under- 
lying dark matter evolve in a nonlinear pattern, the re- 
sulting nonlinear power spectrum P g (k) would, however, 
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Fig. 7. — Ratio of the galaxy power spectrum P g (k) to the 
linear dark matter power spectrum P/,(fc) for the galaxy samples 
of —23 < Mr < —22. The points denote the results from different 
Li, ox simulations, and the lines represent the analytical predictions 
of the halo models for different halo mass functions. 

still remain some features of BAO, but the nonlinear pro- 
cesses do suppress the BAO features. (Because Ph{k) ap- 
pears in the denominator of the ratio, the peaks shown in 
Figure 7 should actually correspond to the troughs in the 
linear theory.) The differences of the simulation results 
between L1200 and L600 at k > 0.7/iMpc" 1 are due to 
the limited resolution of L1200. 

In halo model, P g {k) consists of two components, the 
1-halo and 2-halo terms. On small scales, the 1-halo 
term which is actually dominant has no dependence on 
the halo bias parameters. So the differences of the halo 
models on the small scales mostly come from the choices 
of different MFs. Our results do favor the ST MF over 
the PS MF as expected. On large scales, the dominant 
component is the 2-halo term and the linear halo bias 
factor &i(m) influences the final predictions. The poor 
agreement between the simulation results and the predic- 
tions of the PS and ST models, implies that the MFs and 
the halo bias functions derived from the peak-background 
split assumption, might not be fully consistent with the 
simulation results. And the similar predictions between 
PS and ST models on large scales also suggest that the 
inconsistency is probably due to intrinsic defects of the 
halo model configurations. The deficiency of halo mod- 
els is also revealed by the fact that the BAO wiggles on 
large scales can not be well described by the halo models. 
Here, we do not consider the exclusion effect and the halo 
boundary effect (see e.g., Takada & Jain 2003; Smith et 
al. 2006), which are important only on the quasilinear 
and nonlinear regimes. 

In Figure 8, we show the halo model predictions of the 
galaxy reduced bispectrum Q g (k, u, a) where k (referring 
to k\ in Q g (k\, ki, k^)) describes the size of triangle, u = 
kijk\, and a = arccos{k\ ■ \z.2)lk\k2- The square and 
triangle open points stand for the simulation results of 



L1200 and L600, respectively. The solid and dotted lines 
represent the halo model predictions for the PS and ST 
MFs, respectively. As in Figure 7, the galaxies analyzed 
are in the luminosity range of —23 < M r < — 22. We also 
show for comparison the dark matter reduced bispectrum 
Q m (solid symbols) and PT2 predictions (dashed lines). 
At first glance, the linear relation of Equation (4) seems 
to be true, confirming our conclusions above. 

On all the scales, the PS model seems to fit the simu- 
lations better, while the ST model predicts a larger Q g . 
The difference in the model predictions might reflect the 
fact that the ST MF has more massive halos than the PS 
MF. On large scales of k < 0.05 /iMpc^ 1 , the predictions 
of PS model are consistent with the simulation results 
fairly well. But the model predictions for the ST MF are 
still larger than those of the simulations except for the 
case of k = 0.03 /iMpc -1 , where the fluctuation among 
the simulation realizations is dramatic. 

On the intermediate scales of 0.06ftMpc _1 ~ 
0.1/iMpc" 1 , the PS model agrees with the simulations 
for isosceles triangle configurations but is larger for the 
collinear configurations while the ST model predictions 
are always larger than the simulation results. On smaller 
scales, Q g from simulations shows a flattening trend 
around the isosceles triangle configurations, as in the bot- 
tom panels of Figure 8. But such a feature is not seen so 
evidently in the halo models. 

As in the case of galaxy power spectrum, the halo 
model is still not accurate enough to fulfil the require- 
ment of precision studies of the galaxy bispectrum. Since 
Q g is the ratio of bispectrum B g to a sum of power spec- 
trum products, any better agreement achieved with the 
PS MF is more likely to be a coincidence, considering 
the poor agreement in the P g (k) predictions. Therefore, 
the framework of the halo model, including its impor- 
tant ingredients, needs to be further digested in future 
studies. 

5. CONCLUSIONS 

We use a set of numerical iV-body simulations to 
study the large scale behavior of the galaxy bias param- 
eters with the bispectrum method. We first determine 
the dark matter distribution from our simulations and 
then construct our mock galaxy catalogs from the semi- 
analytical model of Croton et al. (2006). The galaxy bias 
parameters b\ and 62 can be simply obtained by fitting 
the relation of Equation (4) given the knowledge of the 
dark matter and galaxy bispectra. 

We find that on large scales down to k « 0.15 ftMpc -1 , 
the bias factors b\ and 6 2 are nearly constant. More 
importantly, b\ obtained from the bispectrum method 
is consistent with that from the power spectrum ratio 
(Equation (1)), indicating that the linear bias factor b\ 
can be obtained from the distribution of galaxies. Also in 
general, the nonlinear bias 62 is not zero; it is of a nega- 
tive value for galaxies of typical luminosity, and increases 
to a positive value for the most luminous galaxies. On 
scales of k > 0.15 /iMpc -1 , the bias factors rapidly de- 
viate from the constancy shown on large scales. This is 
because the nonlinear effects become important on these 
quasilinear and nonlinear regimes. On these scales, the 
simple relation between Q g and Q m in Equation (4) or 
the assumption that b\ and 62 are constant is not valid. 
It is therefore not effective to use this relation to obtain 
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Fig. 8. — Scale and shape dependence of the galaxy bispcctrum Q g from the analytical predictions of halo models, compared with the 
results from the simulations for the galaxy samples of —23 < M r < —22. The square and triangle open points stand for the results of 
L1200 and L600, respectively. The solid and dotted lines represent the model predictions for PS and ST MFs, respectively. We also show 
for comparison the dark matter reduced bispectrum Q m (solid symbols) and PT2 predictions (dashed lines). 



the bias factors on small scales. 

We also try PT2 to estimate the dark matter bispec- 
trum in the bias Equation (4). But the poor agreement 
with the known linear bias factor shows that it is very- 
important to accurately estimate the dark matter bis- 
pectrum when using Equation (4). This indicates that 
PT2 is still not accurate enough for the dark matter bis- 
pectrum estimation even on fairly large scales. Thus, it 
again implies that higher-order corrections, such as the 
one-loop correction(Scoccimarro & Frieman 1996) in the 
perturbation theory, are necessary for more precise esti- 
mation of the bispectrum in the further studies. 

Because of the simple structure of the halo model 
method, we also incorporate HOD from our mock galaxy 
catalogs to model the galaxy power spectrum and bis- 
pectrum, as shown in Figure 7 and Figure 8. Overall, 



the analytical halo models are not accurate enough to 
describe the power spectrum and bispectrum of galaxies 
on the requirement of current precision studies. More 
work is needed to improve the analytical halo model. 
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